Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction
نویسندگان
چکیده
منابع مشابه
Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction
Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the ...
متن کاملHigh-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing.
A major limitation of high-throughput DNA sequencing is the high rate of erroneous base calls produced. For instance, Illumina sequencing machines produce errors at a rate of ~0.1-1 × 10(-2) per base sequenced. These technologies typically produce billions of base calls per experiment, translating to millions of errors. We have developed a unique library preparation strategy, "circle sequencing...
متن کاملUniversal Count Correction for High-Throughput Sequencing
We show that existing RNA-seq, DNase-seq, and ChIP-seq data exhibit overdispersed per-base read count distributions that are not matched to existing computational method assumptions. To compensate for this overdispersion we introduce a nonparametric and universal method for processing per-base sequencing read count data called FIXSEQ. We demonstrate that FIXSEQ substantially improves the perfor...
متن کاملA heuristic managing errors for DNA sequencing
MOTIVATION A new heuristic algorithm for solving DNA sequencing by hybridization problem with positive and negative errors. RESULTS A heuristic algorithm providing better solutions than algorithms known from the literature based on tabu search method.
متن کاملEvolutionary Approaches to DNA Sequencing with Errors
In the paper, two evolutionary approaches to the general DNA sequencing problem, assuming both negative and positive errors in the spectrum, are compared. The older of them is based on the idea of genetic approach and is enhanced by a greedy algorithm. The newly proposed algorithm combines the tabu search and the scatter search methods. After conducting experiments with random and coding DNA se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Briefings in Bioinformatics
سال: 2015
ISSN: 1467-5463,1477-4054
DOI: 10.1093/bib/bbv029